Safe Functional Inference for Uncharacterized Viral Proteins

نویسندگان

  • Yaniv Loewenstein
  • Michal Linial
چکیده

blue represent proteins in homologous families A and B, while green and black denote other families C and D. Reported BLAST similarities are depicted by curved edges (bottom). A and C coincide on a multi-domain protein (pink and green protein) which may induce false-transitivity – falsely clustering A with nonhomologous C due to local BLAST similarities of multi-domain protein (red edges). Correct merging of A and B is aided by an unassigned protein (white). Safe Functional Inference for Uncharacterized Viral Proteins

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A predictor for toxin-like proteins exposes cell modulator candidates within viral genomes

MOTIVATION Animal toxins operate by binding to receptors and ion channels. These proteins are short and vary in sequence, structure and function. Sporadic discoveries have also revealed endogenous toxin-like proteins in non-venomous organisms. Viral proteins are the largest group of quickly evolving proteomes. We tested the hypothesis that toxin-like proteins exist in viruses and that they act ...

متن کامل

Network-based inference of protein function and disease-gene association

Protein interactions are essential to many aspects of cellular function. On the one hand, they reflect direct functional relationships, i.e., if two proteins interact with each other they are often involved in the same biological process or pathway. On the other hand, alterations in protein interactions, e.g., caused by mutations in their interfaces, perturb natural cellular processes and contr...

متن کامل

Network-based function prediction and interactomics: the case for metabolic enzymes.

As sequencing technologies increase in power, determining the functions of unknown proteins encoded by the DNA sequences so produced becomes a major challenge. Functional annotation is commonly done on the basis of amino-acid sequence similarity alone. Long after sequence similarity becomes undetectable by pair-wise comparison, profile-based identification of homologs can often succeed due to t...

متن کامل

Proteins of Unknown Function in the Protein Data Bank (PDB): An Inventory of True Uncharacterized Proteins and Computational Tools for Their Analysis

Proteins of uncharacterized functions form a large part of many of the currently available biological databases and this situation exists even in the Protein Data Bank (PDB). Our analysis of recent PDB data revealed that only 42.53% of PDB entries (1084 coordinate files) that were categorized under "unknown function" are true examples of proteins of unknown function at this point in time. The r...

متن کامل

Family Classification and Integrative Analysis for Protein Functional Annotation

The high-throughput genome projects have resulted in a rapid accumulation of predicted protein sequences, however, experimentally-verified information on protein function lags far behind. The common approach to inferring function of uncharacterized proteins based on sequence similarity to annotated proteins in sequence databases often results in over-identification, underidentification, or even...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2008